Predicting and Identifying Hypertext in Wikipedia Articles

نویسندگان

  • Neel Guha
  • Cindy Wang
  • Annie Hu
چکیده

1. Ratinov, Roth, Downey, and Anderson. Local and Global Algorithms for Disambiguation to Wikipedia. (University of Illinois at Urbana-Champaign). Retrieved from http://web.eecs.umich.edu/~mrander/pubs/RatinovDoRo.pdf 2. Zhou, Nie, Rouhani-Kalleh, Vasile, and Gaffney. Resolving surface forms to Wikipedia topics. (ACM Digital Library). Retrieved from http://dl.acm.org/citation.cfm?id=1873931 3. Cucerzan. Large-scale Named Entity Disambiguation Based on Wikipedia Data. (Microsoft Research). Retrieved from http://www.aclweb.org/anthology/D07-1074 4. Mihalcea and Csomai. Wikify!: linking documents to encyclopedic knowledge. (ACM Digital Library). Retrieved from http://dl.acm.org/citation.cfm?id=1321475 • Helpful intra-article linking is critical to Wikipedia’s success

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles

When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...

متن کامل

Representation of Hypertext Documents Based on Terms, Links and Text Compressibility

Three methods for representation of hypertext based on links, terms and text compressibility have been compared to check their usefulness in document classification. Documents for classification have been selected from the Wikipedia articles taken from five distinct categories. For each representation dimensionality reduction by Principal Component Analysis has been performed, providing rough v...

متن کامل

Wikipedia as an Ontology for Describing Documents

Identifying topics and concepts associated with a set of documents is a task common to many applications. It can help in the annotation and categorization of documents and be used to model a person's current interests for improving search results, business intelligence or selecting appropriate advertisements. One approach is to associate a document with a set of topics selected from a fixed ont...

متن کامل

Participation and Scientific Collaboration in Persian Wikipedia

Background and Aim: This research studies the effective participation and scientific collaboration in Persian Wikipedia, from 2003-2012.  Method: The library method has been used. Also, considering the objectives and the nature of subject, the research method is a descriptive-applied and during its implementation scientometric technique has been used. Excel and SPSS softwares have been used for...

متن کامل

Neural Joint Learning for Classifying Wikipedia Articles into Fine-grained Named Entity Types

This paper addresses the task of assigning finegrained NE type labels to Wikipedia articles. To address the data sparseness problem, which is salient particularly in fine-grained type classification, we introduce a multi-task learning framework where type classifiers are all jointly learned by a neural network with a hidden layer. In addition, we also propose to learn article vectors (i.e. enti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015